On Finding Frequent Patterns in Directed Acyclic Graphs

نویسندگان

  • Andrea Campagna
  • Rasmus Pagh
چکیده

Given a directed acyclic graph with labeled vertices, we consider the problem of finding the most common label sequences (“traces”) among all paths in the graph (of some maximum length m). Since the number of paths can be huge, we propose novel algorithms whose time complexity depends only on the size of the graph, and on the relative frequency ε of the most frequent traces. In addition, we apply techniques from streaming algorithms to achieve space usage that depends only on ε, and not on the number of distinct traces. The abstract problem considered models a variety of tasks concerning finding frequent patterns in event sequences. Our motivation comes from working with a data set of 2 million RFID readings from baggage trolleys at Copenhagen Airport. The question of finding frequent passenger movement patterns is mapped to the above problem. We report on experimental findings for this data set.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Dagma: Mining Directed Acyclic Graphs

We present how to efficiently mine a set of directed acyclic graphs (DAGs) for unconnected, both multior single-rooted, and induced fragments. With a new canonical form that is based on the nodes' topological levels, our miner is faster and uses less storage than general purpose gSpan (Yan, X. and Han, J., 2002). Moreover, it can base support resp. frequency either on the number of embeddings o...

متن کامل

Mining Frequent Patterns from Human Interactions in Meetings Using Directed Acyclic Graphs

In modern life, interactions between human beings frequently occur in meetings, where topics are discussed. Semantic knowledge of meetings can be revealed by discovering interaction patterns from these meetings. An existing method mines interaction patterns from meetings using tree structures. However, such a tree-based method may not capture all kinds of triggering relations between interactio...

متن کامل

Improved algorithms for replacement paths problems in restricted graphs

We present near-optimal algorithms for two problems related to finding the replacement paths for edges with respect to shortest paths in sparse graphs. The problems essentially study how the shortest paths change as edges on the path fail, one at a time. Our technique improves the existing bounds for these problems on directed acyclic graphs, planar graphs, and non-planar integer-edge-weighted ...

متن کامل

A Calculus and Algebra for Querying Directed Acyclic Graphs

Resumen. In this paper we present a language for querying directed acyclic graph. The language is based on patterns, that is, grosso modo, on variable-free logic expressions. In order to be included in a broader calculus, the language allows the assignment to variables of portions of a graph that match sub-patterns. Finally, we show how the epressions in the language can be translated into a qu...

متن کامل

On Finding Minimum Deadly Sets for Directed Networks

Given a set S of elements in a directed network that are initially faulty, an element becomes (functionally) faulty if all its in-neighbors or all its outneighbors are (functionally) faulty. A set S of initially faulty elements is called deadly if it causes the entire network to become faulty according to the above rule. We show that finding a minimum deadly set is NP-hard for arbitrary directe...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1005.0239  شماره 

صفحات  -

تاریخ انتشار 2010